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CHIMERIC MHC PROTEIN AND OLIGOMER THEREOF 

REFERENCE TO RELATED APPLICATIONS 
[0001] This application is a continuation-in-part of PCT Application PCT/EP03/09056, filed 
August 14, 2003, the text of which is in English. This application may be considered re- 
lated to co-pending, co-owned U.S. Pat. Application No. [Attorney Docket Number S- 

845] , filed [contemporaneously herewith] . This application also may be consid- 
ered related to co-pending, co-owned U.S. Pat. Application No. [Attorney Docket Num- 
ber S-846] , filed [contemporaneously herewith] . 

FIELD OF THE INVENTION 
[0002] The present invention relates to a chimeric MHC protein, an expression cassette en- 
coding the same, a vector, an oligomer of said chimeric protein, a method of labeling, de- 
tecting and separating mammalian T cells according to the specificity of their antigen recep- 
tor by use of the oligomer, and suitable primers for constructing said chimeric protein. 

BACKGROUND OF THE INVENTION 
[0003] Major Histocompatibility Complex (MHC) molecules, which are found on the cell 
surface in tissues, play an important role in presenting cellular antigens in the form of short 
linear peptides to T cells by interacting with T cell receptors (TCRs) present on the surface 
of T cells. 

[0004] It has been established that isolated or recombinant forms of MHC-peptide molecules 
are useful for detecting, separating and manipulating T cells according to the specific pep- 
tide antigens these T cells recognize. It has also been understood that the interaction be- 
tween MHC molecules and TCRs across cell surfaces is multimeric in nature and that the 
affinity of a single MHC molecule for a given TCR is generally low in affinity. 
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[0005] As a consequence, there has been an effort to develop multimeric forms of isolated 
or recombinant MHC-peptide molecules to make such molecules more useful in the applica- 
tions described above. 

[0006] European Patent Application EP 812 331 discloses a multimeric binding complex for 
labeling, detecting and separating mammalian T cells according to their antigen receptor 
specificity, the complex having the formula (a-p-P)n, wherein (a-p-P) is an MHC peptide 
molecule, n is > 2, a comprises an a chain of a MHC class I or MHC class II molecule, p 
comprises a p chain of an MHC protein and P is a substantially homogeneous peptide anti- 
gen. The MHC peptide molecule is multimerised by biotinylating the C terminus of one of 
the a or p chain of the MHC molecule and coupling of MHC monomers to tetravalent strep- 
tavidin/avidin or by providing a chimeric protein of an MHC molecule which is modified at 
the C terminus of one of the a or p chain to comprise an epitope which is recognised by a 
corresponding antibody that serves as a multimerising entity. The document further teaches 
use of the MHC oligomers for detecting, labeling and separating specific T cells according 
to their TCR specificity. 

[0007] European Patent Application EP 665 289 discloses specific peptides, MHC mole- 
cules binding these peptides, and oligomers obtained by crosslinking of the respective MHC 
molecules having the specific peptide bound to them. Oligomerisation is achieved by using 
chemical crosslinking agents or by providing MHC chimeric proteins comprising an epi- 
tope, which is recognised by an immunoglobulin such as IgG or IgM. The MHC molecules 
may comprise a label and may be used for labeling, detecting, and separating T cells ac- 
cording to their specific receptor binding, and may eventually be employed in therapy of 
humans. 

[0008] WO 93/10220 discloses a chimeric MHC molecule, comprising the soluble part of 
an MHC molecule, which can be either class I or class II MHC, fused to an immunoglobu- 
lin constant region. The MHC portion of the molecule comprises complementary a and/or p 
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chains and a peptide is bound in the respective binding grooves of the MHC molecules. Due 
to the presence of the dimeric immunoglobulin scaffold these chimeric MHC-Ig molecules 
undergo self-assembly into a dimeric structure. 

[0009] In other research the oligomerisation domain of cartilage oligomeric matrix protein 
(COMP) has been used as a tool for multimerising several proteins in the past. COMP has 
been described and characterised by Efimov and colleagues (see e.g. Proteins: Structure, 
Function, and Genetics 24:259-262 (1996)). COMP is a pentameric glycoprotein of the 
thrombospondin family. Self-assembly of the protein to form pentamers is achieved through 
the formation of a five-stranded helical bundle that involves 64 N-terminal amino acid resi- 
dues of the protein. The amino acid sequence of the oligomerisation domain has been dis- 
closed by Efimov et al, FEBS Letters 341:54-58 (1994). 

[0010] WO 00/44908 discloses chimeric proteins that contain anti-angiogenic portions of 
TSP-1, TSP-2, endostatin, angiostatin, platelet factor 4 or prolactin linked to a portion of 
the N-terminal region of human cartilage oligomeric matrix protein (COMP) thus allowing 
for the formation of pentamers. The document is predominantly concerned with exploiting 
the anti-angiogenic effect mediated by the resulting chimeric proteins. According to this 
disclosure the chimeric protein should promote correct folding of the TSP-domains con- 
tained therein, so that they better mimic the natural proteins than peptides that are based on 
the TSR sequence. 

[0011] US 6,218,513 discloses globins containing non-naturally occurring binding domains 
for creating oligomers of said globins. The COMP oligomerisation domain is one of the 
disclosed binding domains. The advantage seen from oligomerisation relates to increased 
half-life and hence better resistance against intravasal degradation as well as reduced ex- 
travasation of the oligomerised globin proteins, due to their increased size compared to 
monomeric globin proteins. 
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[0012] Holler et al, Journal of Immunological Methods 237:159-173 (2000), disclose the 
development of improved soluble inhibitors of FasL and CD40L based on oligomerised re- 
ceptors comprising TNF-receptor family members fused to the constant region of IgG or the 
self-assembling domain of COMP. It is concluded there that increased affinity of oligomeric 
soluble chimeric receptors of the TNF-receptor family is not a general phenomenon. It is 
found that the affinity of such oligomeric chimeric receptors to their ligand depends on the 
specific receptor-ligand pair under consideration and this is shown to vary significantly even 
between closely related proteins. 

[0013] The attempts for multimerisation of MHC proteins described hereinbefore pose sev- 
eral disadvantages. Chemical crosslinking for example typically results in a non-predictable 
structure of the final MHC oligomer, which may vary considerably for each complex. 
Hence, binding to the target may vary likewise, depending on the final oligomer structure. 
This in turn can impede accuracy and reliability of any assay system the oligomers are used 
in. In the worst case chemical crosslinking may even prevent formation of a functional 
MHC oligomer altogether. 

[0014] Fusing either one or two of the MHC polypeptide chains with the constant region of 
an immunoglobulin molecule such as described in WO 93/10220 results in a dimeric MHC 
molecular complex. Although the dimeric interaction can contribute to increasing the affin- 
ity of the complex, further multimerisation through anti-idiotypic antibodies or protein A or 
G may be required to reach the affinity required for various applications, such as detecting 
antigen specific T cells or activating such cells successfully. Such two-stage multimerisation 
processes can, however, be time-consuming to carry out and it is difficult to control the 
uniformity of the final product. 



[0015] Multimerisation of the MHC monomers by use of non-human binding partners such 
as the biotin/streptavidin binding pair or non-human antibodies on the other hand introduces 
non-human protein components into the oligomer. This raises concerns with regard to po- 
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tential toxicity of the complex and/or immune responses against this non-human part of the 
complex in applications where in vivo use is envisaged, e.g. in human therapy. Accordingly 
it would be desirable to avoid non-human portions in the oligomeric complex. 

[0016] In addition producing MHC multimers that rely on the biotin-streptavidin interaction 
involves a substantial number of process steps, including several rounds of protein purifica- 
tion, and a biotinylation reaction that can lead to significant loss of active material. Further, 
controlling the biotinylation efficiency of monomeric MHC subunits and quality of the final 
multimeric product is difficult. 

[0017] The MHC oligomers available from the prior art provide for a certain enhancement 
of affinity of the complex when compared to the MHC monomer itself. A further increase 
in affinity would, however, be very desirable without increasing the complexity of the syn- 
thesis of the complexes while assuring that such synthesis will yield molecules with high 
uniformity. 

[0018] It is the object of the present invention to overcome the above drawbacks and other 
disadvantages of the prior art and to provide a multimeric MHC peptide complex that is 
easy to manufacture with a uniformly high valency of MHC peptide components that are 
available simultaneously for binding to T cell receptors and a protein sequence that mini- 
mizes non-human content. 
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SUMMARY OF THE INVENTION 
[0019] The above disadvantages and drawbacks of the prior art regarding the formation of 
oligomeric MHC complexes of MHC monomers are overcome by using at least two chi- 
meric proteins comprising a first section derived from an MHC peptide chain or a functional 
part thereof and a second section comprising an oligomerisation domain derived from an 
oligomer-forming coiled-coil protein to form an oligomeric MHC complex. 

[0020] A first aspect of the present invention therefore concerns an oligomeric MHC com- 
plex comprising at least two chimeric proteins, said chimeric proteins comprising a first 
section derived from an MHC peptide chain or a functional part thereof and a second sec- 
tion comprising an oligomerising domain derived from an oligomer-forming coiled-coil pro- 
tein, wherein formation of the oligomeric MHC complex occurs by oligomerisation at the 
oligomerising domain of the chimeric proteins, and wherein at least two of the first sections 
of all chimeric proteins comprised in said oligomer are derived from the same MHC peptide 
chain. 

[0021] Preferably the first section of the chimeric proteins is derived from the extra-cellular 
part of the MHC class I or II a chain. Alternatively, the first section of the chimeric pro- 
teins may be derived from the extra-cellular part of the MHC class I or II p chain. 

[0022] In a preferred embodiment of the invention the oligomerising domain comprised in 
the second section of the chimeric proteins is derived from the pentamerisation domain of 
the human cartilage oligomeric matrix protein (COMP). More preferably, the pentamerisa- 
tion domain of COMP comprised in the second section of the chimeric proteins comprises 
and preferably consists of the amino acids 1 to 128, preferably 20 to 83, most preferably 20 
to 72 of said protein. 

[0023] Preferably, the chimeric proteins further comprise a first linker between the MHC 
peptide chain (first section) and the oligomerising domain (second section). 
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[0024] Preferably, at least one of the chimeric proteins in the oligomer further comprises 
one or more domains selected from the group consisting of a second linker, a tagging do- 
main and a purification domain. 

[0025] Another embodiment of the invention concerns an oligomeric MHC complex as de- 
scribed above that further comprises the complementary MHC peptide chain to at least two 
of the chimeric proteins to form functional MHC binding complexes. 

[0026] Preferably the oligomeric MHC complex according to the first aspect of the inven- 
tion comprises peptide bound to the MHC portions of the complex in the groove formed by 
the MHC al and a2 domains for class I complexes or the MHC ocl and pi domains for 
class II complexes. Preferably the peptide is substantially homogeneous. 

[0027] The oligomeric MHC complex according to the invention may comprise a label. 
Preferably, the label is selected from the group consisting of a light detectable label, a ra- 
dioactive label, an enzyme, an epitope, a lectin, or biotin. 

[0028] In a second aspect the invention concerns a chimeric protein comprising a first sec- 
tion derived from an MHC peptide chain or a functional part thereof and a second section 
comprising an oligomerising domain derived from an oligomer-forming coiled-coil protein 
which coiled-coil protein oligomerises by alignment of at least two substantially identical 
versions of the polypeptide chain from which the oligomerising domain is derived. Prefera- 
bly the oligomerising domain derived from the pentamerisation domain of COMP. More 
preferably, the oligomerising domain comprises and preferably consists of the amino acids 1 
to 128, preferably 20 to 83, most preferably 20 to 72 of COMP. 

[0029] A third aspect of the invention concerns a recombinant expression cassette compris- 
ing a promoter sequence operably linked to a nucleotide sequence coding for a chimeric 
protein for incorporation into the oligomeric MHC complex of the invention as described 
above. 
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[0030] A fourth aspect of the invention concerns a vector comprising the recombinant 
expression cassette described above. 

[0031] A fifth aspect of the invention concerns a pharmaceutical or diagnostic composition, 
comprising an oligomeric MHC complex of the invention. 

[0032] A sixth aspect of the invention concerns a method of labeling and or detecting 
mammalian T cells according to the specificity of their antigen receptor, the method com- 
prising 

(i) combining an oligomeric MHC complex according to the invention and a suspen- 
sion or biological sample comprising T cells, and 

(ii) detecting the presence of specific binding of said complex and the T cells. 

[0033] A seventh aspect of the invention concerns a method of separating mammalian T 
cells according to the specificity of their antigen receptor, the method comprising 
combining an oligomeric MHC complex according to the invention and a suspension or bio- 
logical sample comprising T cells, and 
separating T cells bound to said complex from unbound cells. 

[0034] In a further aspect thereof the invention also pertains to primers useful in construct- 
ing the chimeric peptides described above. 

DESCRIPTION OF THE DRAWINGS 
[0035] Fig.l is a schematic drawing of an oligomeric MHC class I complex of the inven- 
tion. The figure shows two out of several subunits of the fully assembled oligomeric class I 
chimeric complex. As shown p2-microgobulin (P2m) is fused to the coiled oligomerisation 
domain (COMP) of an oligomer-forming coiled-coil protein and spaced from this domain by 
a first linker (LI). A further linker (L2) is provided at the C terminal end of the domain, 
followed by a recognition sequence (BP) for biotinylation by biotin-protein ligase BirA and 
finally a tagging/purification domain (TD) for purification and detection. A biotin molecule 

(bt), attached to the Lysine residue of the recognition sequence in form of a biotinylation 
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peptide (BP) is also shown. The complementary MHC class I a chain having the domains 
ocl, a2, and oc3, is assembled with p2m and antigenic peptide (P). Disulphide bonds (S-S), 
stabilizing the MHC portion of the complex are indicated. Further disulphide bonds are also 
shown, illustrating the specific case where the oligomerising domain is derived from 
COMP, which will assemble into a stable pentamer, wherein these disulphide bonds link 
each pair of the COMP domains in the pentamer. The amino and carboxyl termini of the 
respective a and p chains in the complex are indicated by N and C, respectively. 

[0036] Fig. 2 is a schematic drawing of an oligomeric MHC class II complex of the inven- 
tion. The figure generally follows the nomenclature of Fig. 1 with the difference that ((52m) 
is replaced by the MHC class II a chain, having the domains al, a2, fused to the oligomer- 
ising domain via the linker (LI). Here the a chain is assembled with its complementary 
MHC class II p chain, having the domains pi, p2. 

[0037] Fig. 3 shows a set of schematic maps, including restriction sites, illustrating (a) 
pETBMC04: the p2m-COMP expression construct, as well as (b)-(d) pETBMC01-03: p2m- 
COMP intermediate molecular cloning constructs, and finally (e) pETA2sol: the HLA- 
A*0201 a chain expression construct, as described in detail later on. Stop codons are indi- 
cated by the '*' symbol for each construct in (a)-(e). 

DETAILED DESCRIPTION OF THE INVENTION 
[0038] The oligomeric MHC complex of the invention allows for overcoming the disadvan- 
tages and drawbacks of the prior art. More specifically, the inventors have found that the 
above oligomeric MHC complex can have a substantially higher affinity to a T cell receptor 
than an oligomer obtained by e.g. tetramerising the MHC complexes by coupling through 
biotin and streptavidin. Without wishing to be bound by theory it is believed that this in- 
crease in affinity is achieved when three or more MHC molecules are arranged substantially 
in the same plane with all binding faces oriented in the same direction. In contrast, because 
the tetrameric streptavidin-coupled complex has a tetrahedral arrangement, at best only 
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three MHC binding domains are available simultaneously for contacting the T cell surface. 
The complex according to the invention further allows for effective labeling while minimiz- 
ing the interference of the chosen label with the active MHC domains, as the label(s) are 
located on the opposite end of the oligomer-forming coiled-coil domain. 

[0039] The meaning of an " oligomer-forming coiled-coil protein" within the scope of the 
present invention is a protein comprising an oligomerising domain. Said domains comprise 
two or more polypeptide subunits that may or may not be identical. The subunit of two or 
more of such oligomerisation domains assemble with one another such that each subunit in 
the oligomerising domain assumes a substantially helical conformation in the assembled 
state, wherein the subunits in the oligomerisation domain are arranged along an identifiable 
axis, wherein the oligomerising domain has two identifiable opposite ends along said axis, 
and wherein at least two of the polypeptide subunits have the same amino-to-carboxyl orien- 
tation along said axis. Corresponding oligomerisation domains and oligomer-forming coiled- 
coil proteins are known in the art as e.g. cited in the introductory portion hereof. 

[0040] The oligomerising domain may be derived from a suitable oligomer-forming coiled- 
coil protein of any species. Preferably, the MHC molecule is fused to an oligomerisation 
domain of a human version oligomer-forming coiled-coil protein, in which case unwanted 
immune responses and/or rejection reactions are minimised in situations where the complex 
is to be administered to humans. 

[0041] Examples for oligomer-forming coiled-coil proteins include various types of colla- 
gen, triple coiled-coil domains of C-type lectins, such as mannose binding protein (MBP); 
Clq, myosin, leucine zippers such those occurring in p53, GCN4, bacteriophage P22 Mnt 
repressor; and the trombospondin family proteins such as COMP. Preferably the oligomeri- 
sation domain is derived from the cartilage oligomeric matrix protein COMP. More pref- 
erably, the MHC molecule is fused to an oligomerisation domain of the human version of 
COMP for the reasons described above. 
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[0042] The number of chimeric proteins (n) comprised on the oligomeric MHC complex of 
the invention will typically depend on the type of oligomerisation domain the second section 
of the chimeric proteins is derived from and can in general be 2 or more, preferably n = 2 
to 10, most preferably n = 3 or 5. If the second section of the chimeric proteins to be oli- 
gomerised is e.g. derived from the pentamerisation domain of the COMP this number will 
typically be five such that the oligomer will be a pentamer (n = 5), whereas in case these 
oligomerisation domains are derived from collagen this number will be three (n = 3). 

[0043] The first aspect of the invention therefore relates to an oligomeric MHC complex 
comprising chimeric proteins comprising a first section derived from an MHC peptide chain 
or a functional part thereof and a second section comprising an oligomerisation domain of 
an oligomer-forming coiled-coil protein. Although the MHC peptide chain of the first sec- 
tion preferably forms the N terminal part of the chimeric protein, whereas the oligomerising 
domain of the second section is located in the C terminal region thereof, both sections can 
in general be in any order. The preferred arrangement will, however, allow the MHC part 
to maintain its functional characteristics for all cases. 

[0044] The term "chimeric protein" as used herein means a single peptide protein, the 
amino acid sequence of which is derived at least in part from two different naturally occur- 
ring proteins or protein chain sections, in this case an MHC peptide and a peptide substan- 
tially comprising at least a significant proportion of an oligomerising domain. With the term 
"a functional part thereof as used herein, a part of a peptide chain is meant, which still 
exhibits the desired functional characteristics of the full-length peptide it is derived from. 
For MHC therefore a peptide chain is meant, which, if necessary, when completed or as- 
sembled with the complementary MHC peptide chain thereof, is capable of binding an anti- 
genic peptide. With the term "complementary" MHC peptide chain the respective other 
peptide chain of a naturally occurring MHC complex is meant. The complementary chain to 
an a chain of the MHC complex is the p chain and vice versa. 
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[0045] According to a first embodiment the MHC peptide chain in the chimeric proteins is 
the extra-cellular part of the MHC class I or II a chain. According to another embodiment 
the MHC peptide chain is the extra-cellular part of an MHC class I or II p chain. The MHC 
proteins may be from any vertebrate species, e.g. primate species, particularly humans; 
rodents, including mice, rats, hamsters, and rabbits; equines, bovines, canines, felines; etc. 
Of particular interest are the human HLA proteins, and the murine H-2 proteins. Included in 
the HLA proteins are the class II subunits HLA-DPot, HLA-DPp, HLA-DQot, HLA-DQP, 
HLA-DRot and HLA-DRp, and the class I proteins HLA- A, HLA-B, HLA-C, and p2 - 
microglobulin. Included in the murine H-2 subunits are the class I H-2K, H-2D, H-2L, and 
the class II I-Aa, I-Ap, I-Ea and I-Ep, and p2-microglobulin. Amino acid sequences of 
some representative MHC proteins are referenced in EP 812 331. 

[0046] In a preferred embodiment, the MHC peptide chains correspond to the soluble form 
of the normally membrane-bound protein. For class I subunits, the soluble form is derived 
from the native form by deletion of the transmembrane and cytoplasmic domains. For class 
I proteins, the soluble form will include the al, a2 and oc3 domains of the a chain. For 
class II proteins the soluble form will include the al and a2 or pi and p2 domains of the a 
chain or p chain, respectively. 

[0047] Not more than about 10, usually not more than about 5, preferably none of the 
amino acids of the transmembrane domain will be included. The deletion may extend as 
much as about 10 amino acids into the a3 domain. Preferably none of the amino acids of 
the a3 domain will be deleted. The deletion will be such that it does not interfere with the 
ability of the a3 domain to fold into a functional disulfide bonded structure. The class I p 
chain, p2m, lacks a transmembrane domain in its native form, and does not have to be trun- 
cated. Generally, no class II subunits will be used in conjunction with class I subunits. 
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[0048] The above deletion is likewise applicable to class II subunits. It may extend as much 
as about 10 amino acids into the ot2 or 02 domain, preferably none of the amino acids of the 
ot2 or p2 domain will be deleted. The deletion will be such that it does not interfere with the 
ability of the a2 or p2 domain to fold into a functional disulfide bonded structure. 

[0049] One may wish to introduce a small number of amino acids at the polypeptide ter- 
mini, usually not more than 25, more usually not more than 20. The deletion or insertion of 
amino acids will usually be as a result of the requirements in cloning, e.g. as a consequence 
of providing for convenient restriction sites or the like, and to manage potential steric prob- 
lems in the assembly of the molecules. In addition, one may wish to substitute one or more 
amino acids with a different amino acid for similar reasons, usually not substituting more 
than about five amino acids in any one domain. 

[0050] According to the present invention the MHC peptide chain comprised in the first 
section of a chimeric protein as defined above is fused, preferably at its C terminal end, to 
the oligomerisation domain of an oligomer-forming coiled-coil protein. Where the pen- 
tamerising domain of COMP is used as the oligomerising domain, this domain comprises, 
more preferably consists of the N terminal amino acids 1 to 128, preferably 20 to 83, most 
preferably 20 to 72 of said protein as discussed in the prior art section of this invention dis- 
closure. With regard to numbering of amino acids, reference is made to Efimov et al., 
FEBS Letters 341:54-58 (1994). Further, similar to the MHC part of the chimeric protein of 
the invention, this domain can be altered by amino acid substitution, deletion or insertion, 
as long as the self-assembly of the oligomerising domain is not impaired. 

[0051] Fusion of both peptide chains, MHC and the oligomerising domain, respectively, 
may be direct or, as is shown in Figures 1 and 2, may include a first linker (LI) connecting 
and spacing apart the MHC peptide and the oligomerising domain (OD) in the chimeric pro- 
tein. Joining by use of a linker is preferred. In general such linker will comprise not more 



13 



Attny. Docket No. S-844-US 



than 30, preferably not more than 26 amino acids. Suitable linkers are known in the art and 
include e.g. immunoglobulin hinge regions, or serine glycine repeat sequences. 

[0052] As is further shown in Figures 1 and 2, the chimeric protein of the invention may 
further include, preferably at its C terminal end, one or more of a second linker (L2), a 
biotinylation peptide (BP) and a tagging domain and/or a purification domain (TD) in either 
order. In a preferred embodiment the chimeric protein of the invention includes all three of 
the above domains in this order. In general the second linker will comprise not more than 
25, preferably not more than 20 amino acids and can be the same as detailed for the first 
linker above. 

[0053] The tagging domain optionally included in the chimeric protein of the invention can 
be any domain which allows for labeling of the protein. Preferably the tagging domain in- 
cludes a label. This label can be included in the domain itself such as an epitope recognised 
by an antibody or a light detectable or radioactive label. Preferably, the label is selected 
from the group consisting of fluorescent markers, such as such as FITC, phycobiliproteins, 
such as R- or B-phycoerythrin, allophycocyanin, Cy3, Cy5, Cy7, a luminescent marker, a 
radioactive label such as 125 I or 32 P, an enzyme such as horseradish peroxidase, or alkaline 
phosphatase e.g. alkaline shrimp phosphatase, an epitope, a lectin or biotin/streptavidin. 

[0054] Where the label is itself a protein, the polypeptide chain of the protein used for label- 
ing can be fused to the chimeric protein, preferably at its C terminus, to the label protein. 
For example a fluorescent protein such as a green fluorescent protein (GFP), or a subunit of 
a phycobiliprotein could be used in this chimeric. GFP chimeric protein technology is well 
known in the art. Chimeric proteins comprising a suitable domain from a phycobiliprotein is 
described, for example in WO 01/46395. 



[0055] Alternatively the label may be attached at a specific attachment site provided in the 
tagging domain. For example, the tagging domain may include a recognition site for the 

14 



Attny. Docket No. S-844-US 



biotin protein ligase BirA to allow for site-specific biotinylation of the tagging domain and 
hence recognition by streptavidin/avidin. Suitable recognition sequences for BirA are de- 
scribed in EP 711 303. Similarly a lectin may be attached, or any other site-specific enzy- 
matic modification may be made by means of incorporating an amino acid recognition se- 
quence for a modifying enzyme into the amino acid sequence of the chimeric proteins of the 
invention. Other possible types of enzymatic modification are described in EP812 331. Al- 
ternatively labeling can be achieved by binding of a suitably labeled antibody, or antibody 
fragment, such as a labeled F(ab) fragment. 

[0056] The purification domain optionally to be included in the chimeric protein of the in- 
vention can be any domain assisting in purification of the protein of the invention e.g. by 
providing specific binding characteristics. Appropriate sequences are known to the skilled 
worker and can be applied as long as they do not interfere with the functional MHC and 
oligomerising domains of the chimeric protein. Preferably the purification domain is a 
hexahistidine sequence. 

[0057] According to the first aspect thereof the present invention relates to an oligomeric 
MHC complex formed by oligomerising through self-assembly of the oligomerising domain 
of an oligomer-forming coiled-coil protein in the second section of the above chimeric pro- 
teins. Oligomerisation of the oligomerising domain typically occurs spontaneously and re- 
sults in stable oligomers of the chimeric proteins. Typically, the oligomeric complex will 
consist of several identical monomeric subunits (at least in their MHC part, depending on 
the oligomerisation domain). If desired, however, two or more chimeric proteins differing 
in their MHC portion may be admixed before oligomerisation. Thereby heterogeneous oli- 
gomers may be obtained. Generally, however, at least two of the chimeric proteins in each 
complex will comprise a first section derived from the same MHC a or p subunit. For n > 
2 the oligomer may contain at least two of the chimeric proteins in each complex which 
comprise a first section derived from the same MHC molecule. 
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[0058] Preferably the oligomerising domain of COMP is used and results in spontaneous 
assembly of stable pentamers of the chimeric protein. 

[0059] The oligomeric MHC complex may further comprise the complementary MHC pep- 
tide chain(s) to form functional MHC binding complexes as discussed above and may also 
comprise a peptide bound in the groove formed by the MHC ctl and a2 domains for class I 
MHC complexes or the MHC ccl and pi domains for class II MHC complexes. Preferably 
the peptide is substantially homogeneous. 

[0060] The antigenic peptides will be from about 6 to 14 amino acids in length for com- 
plexes with class I MHC proteins, and usually about 8 to 11 amino acids. The peptides will 
be from about 6 to 35 amino acids in length for complexes with class II MHC proteins, and 
usually from about 10 to 20 amino acids. The peptides may have a sequence derived from a 
wide variety of proteins. In many cases it will be desirable to use peptides, which act as T 
cell epitopes. The epitope sequences from a number of antigens are known in the art. Alter- 
natively, the epitope sequence may be empirically determined by isolating and sequencing 
peptides bound to native MHC proteins, by synthesis of a series of putative antigenic pep- 
tides from the target sequence, then assaying for T cell reactivity to the different peptides, 
or by producing a series of MHC-peptide complexes with these peptides and quantification 
of their T cell binding. Preparation of peptides, including synthetic peptide synthesis, identi- 
fying sequences, and identifying relevant minimal antigenic sequences is known in the art. 
In any case, the peptide comprised in the oligomeric MHC complex is preferably substan- 
tially homogeneous, meaning that preferably at least 80% of the peptides are identical, more 
preferably at least 90 % and most preferably at least 95%. 

[0061] In a second aspect the present invention relates to a chimeric protein itself, which 
can be oligomerised into the oligomeric MHC complex of the invention. The chimeric pro- 
tein comprises a first section derived from an MHC peptide chain or a functional part 
thereof and a second section comprising an oligomerising domain derived from an oligomer- 
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forming coiled-coil protein which coiled-coil protein oligomerises by alignment of at least 
two substantially identical versions of the polypeptide chain from which the oligomerising 
domain is derived. In other words, the oligomer-forming coiled-coil protein used to derive 
the oligomerising domain in the chimeric protein of the present invention may be a homo- 
oligomer or a hetero-oligomer. Typically, however, it will comprise two or more copies of 
substantially identical polypeptide chain sections in its oligomerising domain that are aligned 
when the oligomer is formed. Preferably oligomerising domain derived from the pentameri- 
sation domain of COMP. Preferred embodiments are as discussed above with regard to the 
oligomeric MHC complex itself. 

[0062] In a third aspect thereof the present invention relates to a recombinant expression 
cassette comprising a promoter sequence operably linked to a first nucleotide sequence cod- 
ing for an MHC peptide chain or a functional part thereof and a second nucleotide sequence 
encoding the oligomerising domain of COMP. In a first embodiment the first nucleotide 
sequence encodes the MHC peptide chain of the extra-cellular part of the MHC class I or II 
a chain. In a second embodiment the first nucleic acid sequence encodes the MHC peptide 
chain of the extra-cellular part of the MHC class I or II 0 chain. Preferably the second nu- 
cleotide sequence encodes amino acids 1 to 128, preferably 20 to 83, most preferably 20 to 
72 of COMP. Preferably the COMP sequence is derived from human COMP. 

[0063] Generally, the nomenclature used herein and the laboratory procedures in recombi- 
nant DNA technology described are those well known and commonly employed in the art. 
Standard techniques are used for DNA and RNA isolation, amplification, and cloning. Gen- 
erally enzymatic reactions involving DNA ligase, DNA polymerase, restriction endonucle- 
ases and the like are performed according to the manufacturer's specifications, using en- 
zyme buffers supplied by the manufacturer. These techniques and various other techniques 
are generally performed as known in the art. Suitable nucleotide sequences for the MHC 
and the oligomer-forming coiled coil domains such as the COMP domain are known in the 
art as discussed hereinbefore. 
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[0064] The DNA constructs will typically include an expression control DNA sequence, 
including naturally-associated or heterologous promoter regions, operably linked to protein 
coding sequences. The expression cassette may further include appropriate start and stop 
codons, leader sequences coding sequences and so on, depending on the chosen host. The 
expression cassette can be incorporated into a vector suitable to transform the chosen host 
and/or to maintain stable expression in said host. 

[0065] The term "operably linked" as used herein refers to linkage of a promoter upstream 
from one or more DNA sequences such that the promoter mediates transcription of the 
DNA sequences. Preferably, the expression control sequences will be those eukaryotic or 
non-eukaryotic promoter systems in vectors capable of transforming or transfecting desired 
eukaryotic or non-eukaryotic host cells. Once the vector has been incorporated into the ap- 
propriate host, the host is maintained under conditions suitable for high-level expression of 
the nucleotide sequences, and the collection and purification of the chimeric proteins. 

[0066] MHC-DNA sequences and DNA sequences for suitable oligomerising domains from 
an oligomer-forming coiled-coil protein, including that of COMP, can be isolated in accor- 
dance with well-known procedures from a variety of human or other cells. Traditionally, 
desired sequences are amplified from a suitable cDNA library, which is prepared from mes- 
senger RNA that is isolated from appropriate cell lines. Suitable source cells for the DNA 
sequences and host cells for expression and secretion can be obtained from a number of 
sources, such as the American Type Culture Collection (ATCC) or other commercial sup- 
pliers. 

[0067] The nucleotide sequences or expression cassettes used to transfect the host cells can 
be modified according to standard techniques to yield chimeric molecules with a variety of 
desired properties. The molecules of the present invention can be readily designed and 
manufactured utilizing various recombinant DNA techniques well known to those skilled in 
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the art. For example, the chains can vary from the naturally occurring sequence at the pri- 
mary structure level by amino acid insertions, substitutions, deletions, and the like. These 
modifications can be used in a number of combinations to produce the final modified protein 
chain. In general, modifications of the genes encoding the chimeric molecule may be readily 
accomplished by a variety of well-known techniques, such as site-directed mutagenesis. 

[0068] The amino acid sequence variants as discussed above can be prepared with various 
objectives in mind, including increasing the affinity of the molecule for. target T cells, in- 
creasing or decreasing the affinity of the molecule for the respective CD4 or CD8 co- 
receptors interacting with class I or class II MHC complexes, for facilitating purification 
and preparation of the chimeric molecule or for increasing the stability of the complex, in 
vivo and ex vivo. The variants will, however, typically exhibit the same or similar biological 
activity as naturally occurring MHC molecules and retain and retain their oligomerising 
property of the corresponding naturally occurring oligomerising domain of the relevant oli- 
gomer-forming coiled-coil protein, respectively. 

[0069] In addition, preparation of the oligomer is a straightforward process. For therapeutic 
uses this may be carried out in mammalian cell culture, e.g. in CHO, COS, or human cell 
lines. Alternatively other expression systems can be used for expressing the molecules of 
the present invention, such as insect cell culture, including baculovirus and Drosophila 
melanogaster expression systems, yeast expression systems, or prokaryotic expression sys- 
tems such as Escherichia coli (E. coli). If expression is performed in a prokaryotic expres- 
sion system, either soluble expression, i.e. directed into the cytoplasm or periplasm, or in- 
soluble expression, i.e. into inclusion bodies is possible. 

[0070] A typical vector for E. coli would be one of the pET family of vectors which com- 
bine efficient expression from bacteriophage T7 RNA polymerase within an inducible lac 
operon-based system. The vectors containing the nucleotide segments of interest can be 
transferred into the host cell by well-known methods, depending on the type of cellular host. 
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For example, transformation into chemical- or electro-competent cells is commonly utilised 
for prokaryotic cells, whereas calcium phosphate treatment, cationic liposomes, or electro- 
poration may be used for other cellular hosts. Other methods used to transfect mammalian 
cells include the use of Polybrene, protoplast chimeric, microprojectiles and microinjection. 

[0071] The chimeric protein and the complementary MHC a or p subunits, respectively, 
may be co-expressed and assembled as oligomeric complexes in the same cell. Alternatively 
these two components may be produced separately and allowed to associate in vitro in the 
presence of the antigenic peptide of choice to form stable oligomeric MHC peptide com- 
plexes. In the latter case mixing and association of the two components may occur either 
before or after oligomerisation of the chimeric peptide 

[0072] The advantage of association of separate components in vitro is that oligomeric 
MHC-peptide complexes can be obtained with very high peptide homogeneity, e.g. greater 
than 95% or even greater than 99%. Where the complexes are expressed as fully assembled 
molecules, the peptide of interest can be introduced into the complexes, either by culturing 
the expressing cells with medium containing the antigenic peptide of interest, or by ex- 
changing the peptide of interest with peptides that have endogenously bound during expres- 
sion in vitro, which is typically done by incubating the purified complex with excess peptide 
at low or high pH so as to open the antigen binding pocket (see e.g. WO 93/10220). The 
antigenic peptide can also be covalently bound using standard procedures such as photoaf- 
finity labeling (see e.g. WO 93/10220). 

[0073] Alternatively, the peptide may be directly linked or fused to the expression construct 
of the chimeric protein or its complementary a or p chain. A suitable linker between the 
peptide portion and the N terminus of the chimeric protein or its complementary a or p 
chain, such as a polyglycine repeat sequence may be provided. Including the peptide as a 
chimeric peptide in the expression construct will allow for expression and folding of the 
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complete MHC-peptide oligomeric complex without further addition of antigenic peptide or 
peptide exchange. 

[0074] Conditions that permit folding and association of the subunits and chimeric proteins 
in vitro are known in the art. Assembly of class I MHC peptide complexes as well as as- 
sembly of functional class II complexes is e.g. described in EP 812 331. As one example of 
permissive conditions, roughly equimolar amounts of solubilised a and p subunits are mixed 
in a solution of urea in the presence of an excess of antigenic peptide of interest. In the case 
of class II MHC molecules refolding is initiated by dilution or dialysis into a buffered solu- 
tion without urea. Peptides are loaded into empty class II heterodimers at about pH 5 to 5.5 
over about 1 to 3 days, followed by neutralization, concentration and buffer exchange. Oli- 
gomerisation of the complex should occur simultaneously with formation of the a - p - pep- 
tide complex. Alternatively a pentameric a or p complex may be achieved in two stages: 
first allowing oligomerisation of the oligomerising domain, followed by further refolding 
with the complementary a or p subunit in the presence of peptide according to the methods 
described above. 

[0075] The assembled complex of the invention, or, initially the separate chimeric protein 
of the present invention and the complementary a or p subunit, can be purified according to 
standard procedures of the art, including ammonium sulfate precipitation, gel electrophore- 
sis, column chromatography, including gel filtration chromatography, ion exchange chroma- 
tography, hydrophobic interaction chromatography, affinity chromatography, and the like. 

[0076] The resulting peptide chain or oligomeric MHC complex of the present invention 
may be used therapeutically or in developing and performing assay procedures, immuno- 
stainings, and the like. Therapeutic uses of the oligomeric MHC complexes of the present 
invention require identification of the MHC haplotypes and antigens useful in treating a par- 
ticular disease. In addition it may be necessary to determine the tissue types of patients be- 
fore therapeutic use of the complexes according to the present invention, which is, however, 
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a standard procedure well known in the art. The present invention is particularly suitable for 
treatment of cancers, infectious diseases, autoimmune diseases, and prevention of transplant 
rejection. Based on knowledge of the pathogenesis of such disease and the results of studies 
in relevant animal models, one skilled in the art can identify and isolate the MHC haplotype 
and antigens associated with a variety of such diseases. 

[0077] In a fifth aspect the present invention therefore relates to a pharmaceutical or diag- 
nostic composition, comprising the above oligomeric MHC complexes as defined above. 
Pharmaceutical compositions comprising the proteins are useful for, e.g. parenteral admini- 
stration, i.e. subcutaneously, intramuscularly or intravenously. In addition, a number of 
new drug delivery approaches are being developed. The pharmaceutical compositions of the 
present invention are suitable for administration using these new methods, as well. 

[0078] The compositions for parenteral administration will commonly comprise a solution 
of the chimeric protein or pentamer thereof dissolved in an acceptable carrier, preferably an 
aqueous carrier. A variety of aqueous carriers can be used, e.g. buffered water, 0.4 % sa- 
line, 0.3 % glycine and the like. These solutions are sterile and generally free of particulate 
matter. These compositions may be sterilised by conventional, well-known sterilization 
techniques. The compositions may contain pharmaceutically acceptable auxiliary substances 
as required to approximate physiological conditions such as pH adjusting and buffering 
agents, toxicity adjusting agents and the like, for example sodium acetate, sodium chloride, 
potassium chloride, calcium chloride, sodium lactate, etc. The concentration of the chimeric 
protein in these formulations can vary widely, i.e. from less than about 1 pg/ml, usually at 
least about 0. 1 mg/ml to as much as 10 - 100 mg/ml and will be selected primarily based on 
fluid volumes, viscosities, etc. in accordance with the particular mode of administration 
selected. 



[0079] A typical pharmaceutical composition for intramuscular injection could be made up 
to contain 1 ml sterile buffered water, and 0.1 mg of oligomer complex protein. A typical 
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composition for intravenous infusion could be made up to contain 250 ml of sterile Ringer's 
solution, and 10 mg of oligomer complex protein. Actual methods for preparing parenter- 
ally administrable compositions will be known or apparent to those skilled in the art. 

[0080] The chimeric proteins or oligomeric MHC complexes of this invention can be lyophi- 
lised for storage and reconstituted in a suitable carrier prior to use. This technique has been 
shown to be effective and commonly used lyophilization and reconstitution techniques can be 
employed. It will be appreciated by those skilled in the art that lyophilization and reconstitu- 
tion can lead to varying degrees of activity loss and that use levels may have to be adjusted to 
compensate. 

[0081] Detecting or separating the T cells according to the specificity of their antigen recep- 
tor, as described hereinbefore, may involve any known technique. Suitable techniques are 
e.g. disclosed in EP 812 331, which is incorporated by reference herein. More in detail, the 
oligomeric MHC complexes of the invention can be used in labeling and detection of anti- 
gen specific T cells in suspension or other biological samples, such as in tissue samples, or 
may be used for separation of T cells e.g. when bound or immobilised on substrates, includ- 
ing paramagnetic or other beads as known to a skilled worker, or by flow cytometry. 

[0082] Hence the complexes of the present invention can be used for example to purify and 
enrich T cells that are specific for a particular antigen and re-introduce these T cells to a 
patient autologously after expansion and/or other manipulation in cell culture in various dis- 
ease situations. Alternatively T cells could be selectively depleted, e.g. in the case of an 
autoimmune disorder or other unwanted T cell immune responses. 

[0083] Due to their enhanced affinity for specific T cell receptors, the oligomeric MHC 
complexes according to the present invention can allow for improved drug potency, better 
serum half-life and also improved pharmacokinetics, the latter presumably being due to in- 
creased molecular mass. 
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[0084] Due to the possibility of expressing the chimeric proteins and complementary MHC 
a or p portions comprised in the oligomeric MHC complexes of the invention in non- 
eukaryotic systems they are further easy to make at high yields, and can be expressed as 
sub-components, which can subsequently be refolded with one another in the presence of a 
homogeneous population of antigenic peptide. 

[0085] These and other advantages are available to the skilled worker from the foregoing 
description. The following examples are given for the purpose of illustration only and shall 
not be construed as limiting the present invention in any way. 

EXAMPLES 

[0086] The following is a detailed example for cloning, expressing, and purifying a pen- 
tameric class I MHC complex, which comprises a chimeric fusion of p2m with COMP. The 
chimeric p2m-COMP protein is expressed in insoluble inclusion bodies in E. coli and sub- 
sequently assembled as pentameric p2m-COMP in vitro. The pentameric class I MHC pep- 
tide complex is then formed in a second refolding reaction by combining p2m-COMP pen- 
tamers and the human MHC class I a molecule known as HLA-A*0201, in the presence of 
an appropriate synthetic binding peptide representing the T cell antigen. In this example, a 
well characterized antigen derived from Epstein-Barr virus BMLF1 protein, GLCTLVAML 
(a.a. 289-297) [SEQ ID NO: 1], is used. The resultant complex is labelled with a fluores- 
cent entity and used as a staining reagent for detecting antigen-specific T cells from a mixed 
lymphocyte population, in a flow cytometry application. 

Molecular cloning of the (32m-COMP construct 

[0087] The strategy involves the sequential cloning into pET-24c vector of p2m, yielding a 
construct referred to as pETBMCOl, followed by the insertion of the oligomer isation do- 
main of cartilage oligomeric matrix protein (COMP) with a biotin acceptor sequence (BP) 
for site-specific biotinylation with the biotin-protein ligase BirA, yielding a construct re- 
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ferred to as pETBMC02. Thirdly a polyglycine linker is cloned in between p2m and 
COMP, yielding a construct referred to as pETBMC03, and finally, a serine-residue is re- 
moved by site-directed mutagenesis, which serine residue precedes the poly-glycine linker, 
to give the final p2m-COMP/pET-24c construct, referred to as pETBMC04 (see also Figure 
3). Removal of the serine residue is carried out to avoid steric hindrance when the P2m 
molecule is associated with the MHC class 1 0 chain protein. 

Cloning fi2m into the pET-24c vector 
Source of 32m 

[0088] The extracellular portion of P2m comprises of 99 amino acids (equivalent to Ilel- 
Met99 of the mature protein) encoded by 74 bp-370 bp of the DNA sequence. This region 
of the P2m sequence is amplified from a normal human lymphocyte cDNA library, by po- 
lymerase chain reaction (PCR) using the primers BMC#1 [SEQ ID NO: 2] (1 uM) and 
BMC#2 [SEQ ID NO: 3] (1 \\M) which incorporate Ndel and BamHl restriction sites, 
respectively. (Sigma-Genosys, see Appendix I for primer sequences). Amplification is 
carried out using a proof-reading DNA polymerase (Pfic, Invitrogen) (1.25 U) in Pfic 
amplification buffer (as supplied by the manufacturer), supplemented with 2 mM MgS04 
and 0.2 mM dNTPs following standard thermal cycling conditions. Step 1: 94°C, 4 min; 
Step 2: 94°C, 1 min; Step 3: 55°C, 1 min; Step 4: 72°C, 30 sec; Step 5: cycle to Step 2, 
34x, Step 6: 72°C 10 min. 

Restriction enzyme digestion of P2m and pET-24c 

[0089] p2m PCR product is purified from the above reaction mix using a QIAquick* PCR 
purification kit according to the manufacturer's instructions (Qiagen). 200 ng of purified 
PCR product and 1 ug pET-24c vector (Novagen) are each digested with BamH I (10 U) 
and Nde I (10 U) restriction enzymes (New England Biolabs, NEB) for 4 h at 37°C, in ac- 
cordance with the manufacturer's instructions. The digested fragments are separated by 
horizontal gel electrophoresis in 1 % agarose (Bioline), and purified using a QIAEX* II gel 
extraction kit, according to the manufacturer's instructions (Qiagen). 
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Ligation of 32m into pET-24c (pETBMCOl) 

[0090] The gel-purified insert and vector DNA are ligated at a 1:3 molar ratio (vec- 
toninsert, 50 ng: 7.5 ng) using T4 DNA ligase (5 U; Bioline), in T4 DNA ligase buffer (as 
supplied) for 16 hrs at 16°C. 

Transformation of pETBMCOl into E. coli host strain XLl-Blue 

[0091] The ligation mixtures and appropriate controls are subsequently transformed into 
XLl-Blue strain competent E. coli cells, according to the manufacturer's instructions 
(Stratagene). Successful transformants are selected by plating the cells on Luria-Bertani 
(LB) agar plates containing 30 ^ig/ml kanamycin, and incubating overnight at 37°C. 

PCR screening of transformants 

[0092] A selection of single colonies from the bacterial transformation plates are screened 
by PCR with T7 promoter [SEQ ID NO: 4] (1 jiM) and T7 terminator [SEQ ID NO: 5] (1 
I^M) primers (Sigma Genosys, see Appendix I for primer sequences), which are comple- 
mentary to regions of the pET vector flanking the cloning site. Amplification is carried out 
using Taq DNA polymerase (1 U, Bioline) in Taq reaction buffer (as supplied), supple- 
mented with 2 mM MgS04 and 0.2 mM dNTPs, using 25 thermal-cycling reactions as de- 
tailed above. Successful transformants generated a DNA fragment of approximately 500 bp, 
ascertained by 1.5 % agarose gel electrophoresis. 

Plasmid DNA preparation 

[0095] Bacterial transformants that generated the correct size of PCR products are inocu- 
lated into 6 ml of sterile LB-kanamycin medium and incubated overnight at 37 °C with 200 
rpm shaking. pETBMCOl plasmid DNA is recovered from the bacterial cultures using a 
QIAprep* Spin Mini-prep kit according to the manufacturer's instructions (Qiagen). The 
presence of the [32m fragment in these plasmids is further verified by automated DNA se- 
quencing. 
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[0096] A schematic illustration of pETBMCOl is shown in Figure 3(b), specifically illustrat- 
ing restriction sites. 

Cloning COMP-BP into pETBMCOl 
Synthesis of the COMP-BP cassette 

[0097] The sequence of the oligomerisation domain of COMP is obtained from the Genbank 
database (accession #1705995) and a region encoding the coiled-coil domain (amino acids 
21-85) is selected based on self-association experiments of COMP (Efimov et al, FEBS 
Letters 341:54-58 (1994)). A biotin acceptor sequence 'BP': SLNDIFEAQKIEWHE [SEQ 
ID NO: 6] is incorporated at the C terminus and an additional 14 amino acid linker, 
PQPQPKPQPKPEPET [SEQ ID NO:7] is included to provide a physical separation between 
the COMP oligomerising domain and BP. 

[0098] The whole region is synthesized using the overlapping complementary oligonucleo- 
tides (BMC#3, BMC#4, BMC#5, BMC#6, BMC#7 and BMC#8) [SEQ ID NOS: 8, 9, 10, 
11, 12 and 13] in a 'PCR assembly' reaction. Oligonucleotide sequences are detailed in Ap- 
pendix I and are synthesised and purified ('PAGE-pure') by Sigma-Genosys. The primers 
(4.8 pmoles) are phosphorylated with T4 kinase (10 U, Gibco) for 1 hour at 37°C, in for- 
ward reaction buffer (as supplied), supplemented with 1 mM ATP. The reaction is termi- 
nated by heating to 80°C for 15 min and then allowed to cool to room temperature, which 
enabled oligonucleotide annealling. The annealed oligonucleotides are ligated together using 
T4 DNA ligase as in Section 1.1, and ~ 10 ng used in a PCR to Till in' the ends of the se- 
quences and^to amplify the synthetic gene. The PCR reaction is essentially as described in 
Section 1.1, except BMC#3 [SEQ ID NO: 8] (15 uM) and BMC#8 [SEQ ID NO: 13] (15 
|iM) are used as the forward and reverse primers in the following thermal cycling condi- 
tions: Step 1: 95 °C, 4 min; Step 2: 95°C, 1 min; Step 3: 70°C, 1 min; Step 4: 72°C, 2 
min; Step 5: cycle to Step 2, 15x; Step 6: 72°C, 10 min. A PCR product of the correct size 
is gel purified as detailed previously (Section 1.1). 
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Insertion of COMP-BP cassette into pETBMCOl to produce pETBMC02 
[0099] 200 ng of purified COMP-BP and 1 ng pETBMCOl vector are digested for 4 hrs at 
37°C using Hind III (10 U) and Xho I (10 U) restriction enzymes (NEB), as described in 
Section 1.1. The digestion products are purified, ligated, transformed and PCR screened as 
in Section 1.1. Plasmids positive from the screen are purified and sequenced as described in 
Section 1.1. 

[095] A schematic diagram of pETBMC02 is shown in Figure 3(c), illustrating restriction 
sites. The section encoding COMP-BP is out-of-frame in this construct. 

Cloning the poly-glycine linker into pETBMC02 
Synthesis of the poly-glycine linker 

[096] The poly-glycine linker is synthesized by annealing overlapping oligonucleotides 
BMC#9 [SEQ ID NO: 14] and BMC#10 [SEQ ID NO: 15] (Sigma-Genosys, 'PAGE-pure', 
Appendix I). The poly-glycine linker comprises the following motif in single letter amino 
acid code: GS(GGGGS) 4 GGK [SEQ ID NO: 16], when encoded in the correct reading 
frame. The first two residues are encoded by the BamH I restriction site, whereas the last 
residue is encoded by the Hind III restriction site. Since the nucleotide sequence of the po- 
lyGlycine linker only incorporates the 5' overhang of the cut BamH I restriction site, and 
the 3' overhang of the cut Hind III nucleotide recognition motifs, there is no need to digest 
the annealed product to produce the complementary single-stranded overhangs suitable for 
subsequent ligation. The oligonucleotides are phosphorylated and annealed as described in 
Section 1.2. 

Insertion of the poly-glycine linker into pETBMC02 to produce pETBMC03 

[097] pETBMC02 is digested with Bamli I (10U) and Hind III (10 U) . Ligation of the 

annealed poly-glycine linker into pETBMC02 was as described previously (Section 1.1), 
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assuming 96 fmoles of annealed oligonucleotide/^. The transformation and PCR-screening 
reactions are as described in Section 1.1, but in addition, the presence of an inserted linker 
is verified by a restriction enzyme digestion of the PCR screen product to ascertain the 
presence or absence of a Sal I restriction site. Successful transformants are not susceptible 
to Sal I digestion, given the removal of the site from the plasmid vector backbone. Purifica- 
tion of pETBMC03 and automated sequencing is as described in Section 1.1. 

[098] A schematic diagram of pETBMC03 is shown in Figure 3(d), illustrating restriction 
sites. The BamR I site encodes a serine residue immediately before the poly-glycine se- 
quence. This serine residue is ultimately removed to give pETBMC04, as illustrated in 3(a). 

Deletion of a serine residue immediately prior to the poly-glycine linker 
Site-directed mutagenesis of pETBMC03 

[099] Analysis of X-ray crystallography models of MHC class I molecules reveal that the 
C terminus of p2m closely abuts the a3 domain of the a chain. It is therefore desirable to 
achieve maximum flexibility at the start of the poly-glycine linker, by removal of the serine 
residue encoded by the BamU I restriction motif, resulting in a linker comprising 
G(GGGGS>GGK [SEQ ID NO: 17]. PCR primers BMC#11 [SEQ ID NO: 18] and 
BMC#12 [SEQ ID NO: 19] ('PAGE-pure', Appendix I) are obtained, which enabled re- 
moval of the TCC triplet that encodes serine within the Bam HI recognition motif, using a 
site-directed mutagenesis kit ('QuickChange', Stratagene) strictly in accordance with the 
manufacturer's instructions. Corrected versions of this construct are verified by automated 
sequencing (Section 1.1) and designated pETBMC04. 

[0100] A schematic illustration of pETBMC04 is shown in Figure 3(a). Specifically this 
figure shows a schematic diagram of the completed p2m-COMP construct in pET24c 
(pETBMC04), illustrating the restriction sites. 

Molecular cloning of the HLA-A*0201 a chain construct 
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Cloning A*0201 a chain into the pET-lld vector 

[0101] The extracellular portion of HLA-A*0201 a chain (EMBL M84379) comprises of 
276 amino acids (equivalent to Glyl-Pro276 of the mature protein) encoded by bases 73-900 
of the messenger RNA sequence. This region of the A*0201 sequence is amplified from a 
normal human lymphocyte cDNA library by PCR, using the primers A2S#1 [SEQ ID NO: 
20] and A2S#2 [SEQ ID NO: 21] which incorporated Ncol and BamHl restriction sites re- 
spectively. The procedure for cloning the A*0201 insert into Nco HBamR I-digested pET- 
lld vector (Novagen) is essentially as described for p2m in Section 1.1. 

[0102] The resulting plasmid is named pETA2sol, and a schematic diagram is shown in Fig- 
ure 3(e), illustrating restriction sites. 

Expression of p2m-COMP and HLA-A*0201 a chain proteins 

[0103] An identical procedure is carried out to produce either p2m-COMP or A*0201 a 
chain proteins. Plasmid DNA is transformed into an E. coli expression host strain in prepa- 
ration for a large scale bacterial prep. Protein is produced as insoluble inclusion bodies 
within the bacterial cells, and is recovered by sonication. Purified inclusion bodies are solu- 
bilised in denaturing buffer and stored at -80°C until required. Unless specified, chemical 
reagents are obtained from Sigma. 

Large scale production of recombinant proteins 

Transformation of pETBMC04 and pETA2sol into E. coli host strain BL21(DE3)pLysS 
[0104] Purified plasmid DNA is transformed into the BL21(DE3)pLysS E. coli strain, 
which carries a chromosomal copy of the T7 RNA polymerase required to drive protein 
expression from pET-based constructs. Transformations into BL21(DE3)pLysS competent 
cells (Stratagene) are carried out with appropriate controls, according to the manufacturer's 
instructions, using 0.5jag purified plasmid DNA for each reaction. Successful transformants 
are selected on LB-agar plates containing 34 ug/ml chloramphenicol, in addition to kanamy- 
cin 30ug/ml (pETBMC04) or 100|xg/ml ampicillin (pETA2sol). 
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Growth of E. coli cultures 

[0105] A single bacterial transformant colony is innoculated into 60ml sterile LB medium, 
containing appropriate antibiotics for selection, and left to stand overnight in a warm room 
(~24°C). The resulting overnight culture is added to 6 litres of LB and grown at 37°C with 
shaking (-240 rpm), up to mid-log phase (ODeoo = 0.3-0.4). Protein expression is induced 
at this stage by addition of 1.0 ml of 1M IPTG to each flask. The cultures are left for a fur- 
ther 4 h at 37°C with shaking, after which the cells are harvested by centrifugation and the 
supernatant discarded. 

Sonication and recovery of proteins 

[0106] The bacterial cell pellet is resuspended in ice-cold balanced salt solution and soni- 
cated (XL series sonicator; Misonix Inc., USA) in a small glass beaker on ice in order to 
lyse the cells and release the protein inclusion bodies. Once the cells are completely lysed 
the inclusion bodies are spun down in 50 ml polycarbonate Oak Ridge centrifuge tubes in a 
Beckman high-speed centrifuge (72 series) at 15,000 rpm for 10 min. The inclusion bodies 
are then washed three times in chilled Triton* wash buffer (0.5% Triton* X-100, 50 mM 
Tris, pH 8.0, 100 mM sodium chloride, 0.1% sodium azide and freshly added 2 mM dithio- 
threitol [DTT]), using a hand-held glass homogeniser to resuspend the pellet after each 
wash. This is followed by a final wash in detergent-free wash buffer. 

[0107] The resultant purified protein preparation is solubilised in 20-50 ml of 8 M urea 
buffer, containing 50 mM MES, pH 6.5, 0.1 mM EDTA and 1 mM DTT, and left on an 
end-over-end rotator overnight at 4°C. Insoluble particles are removed by centrifugation and 
the protein yield is determined using Bradford's protein assay reagent (Bio-Rad Laborato- 
ries) and by comparison with known standards. The quality of expression and purification is 
also verified by vertical electrophoresis of samples on a 15% sodium dodecyl sulphate- 
polyacrylamide gel (SDS-PAGE). Typical protein yields are 25-75 mg per litre of LB cul- 
ture. The p2m-COMP and A*0201 a chain proteins are approximately 27 (kD) and 35 kD 
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in size, respectively. Urea-solubilised protein is dispensed in lOmg aliquots and stored at - 
80°C for future use. 

Formation of pentameric ft2m-COMP complexes 
Assembly ofP2m-COMP into pentamers 

[0108] Assembly of p2m-COMP from the urea-solubilised inclusion bodies is performed by 
diluting the protein into 20 mM CAPS buffer, pH 11.0, containing 0.2 M sodium chloride 
and 1 mM EDTA, to give a final protein concentration of 1.5 mg/ml. The protein is oxi- 
dised at room temperature by addition of oxidised and reduced glutathione to final concen- 
trations of 20 mM and 2 mM, respectively. Following an overnight incubation, disulphide 
bond formation is analysed by non-reducing SDS-PAGE on 10 % bis-tricine gels (Invitro- 
gen). 

[0109] The protein mixture is subsequently buffer exchanged into 20 mM Tris, pH 8.0, 50 
mM sodium chloride ('S200 buffer'), using Centricon Plus-20 ultra-centrifugal concentra- 
tors (10 kD molecular weight cut-off, Milllipore) according to the manufacturer's instruc- 
tions, and concentrated to a final volume of 4.5 ml, in preparation for enzymatic biotinyla- 
tion with BirA (Affinity, Denver, Colorado). 0.5 ml of lOx BirA reaction buffer (as sup- 
plied) is added, and recombinant BirA enzyme at 10 yiM. final concentration, supplemented 
with 10 mM ATP, pH 7.0. A selection of protease inhibitors is also used to preserve the 
proteins: 0.2 mM PMSF, 2 ug/ml pepstatin and 2 ug/ml leupeptin. The reaction is left for 4 
hours at room temperature. 

Purification of biotinylated fi2m-COMP pentamers 
[0110] Biotinylated p2m-COMP is purified by size exclusion chromatography (SEC) on a 
Superdex®200 HR 26/60 column (Amersham Biosciences), running S200 buffer. The col- 
umn is previously calibrated and used according to the manufacturer's instructions and frac- 
tions containing pentamers in the region of 135 kD are collected. Protein recovery is deter- 



32 



Attny. Docket No. S-844-US 



mined using the Bradford Assay, and biotinylation efficiency is determined by use of HABA 
reagent (Pierce), according to manufacturer's instructions. 

Formation of refolded HLA-A*0201/GLCTLVAML complexes 
Refolding HLA-A*0201 a chain with biotinylated p2m-COMP and peptide 
[0111] 500 ml of refolding buffer is prepared as follows: 100 mM Tris, pH 8.0, 400 mM L- 
arginine hydrochloride, 2 mM EDTA, 5 mM reduced glutathione and 0.5 mM oxidised glu- 
tathione, dissolved in deionised water and left stirring at 4°C. 15 mg of lyophilised synthetic 
peptide GLCTLVAML is dissolved in 0.5 ml dimethylsulfoxide and added to the refolding 
buffer whilst stirring. 50 mg of biotinylated pentameric p2m-COMP and 30 mg of A*0201 
a chain is added sequentially, injected through a 23gauge hypodermic needle directly into 
the vigorously-stirred buffer, to ensure adequate dispersion. The refolding mixture is then 
left stirring gently for 16 hours at 4°C. 

Purification of refolded pentameric HLA-A*0201 /GLCTLVAML complexes 
[0112] The protein refolding mixture is subsequently concentrated from 500 ml to 20 ml 
using a MiniKros hollow fibre ultrafiltration cartridge (Spectrum Labs, Rancho Dominguez, 
California) with a 30 kD molecular weight cutoff. Further concentration of the complex 
from 20 ml to 5 ml is carried out in Centricon Plus-20 centrifugal concentrators (30 kD mo- 
lecular weight cut-off) according to the manufacturers instructions, followed by buffer ex- 
change into S200 buffer using disposable PD10 desalting columns (Amersham Biosciences), 
according to the manufacturer's instructions. Final volume is 7.5 ml. The concentrated pro- 
tein refold mixture is first purified by SEC on a Superdex® 200 HR 26/60 gel filtration 
chromatography column, as in Section 4.2. Fractions containing protein complexes in the 
region of 310 kD is collected. 

[0113] Fractions collected from SEC are pooled and subjected to further purification by 
anion exchange chromatography on a MonoQ® HR 5/5 column (Amersham Biosciences), 
running a salt gradient from 0 - 0.5 M sodium chloride in 20 mM Tris over 15 column vol- 
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umes. The dominant peak is collected. Protein recovery is determined using the Bradford 
assay. 

Fluorescent labelling of purified complexes 

[0114] Since each streptavidin molecule is able to bind up to 4 biotin entities, final labelling 
with phycoerythrin (PE)-conjugated streptavidin is carried out in a molar ratio of 1:0.8, 
streptavidin to biotinylated pentamer complex respectively, taking into account the initial 
biotinylation efficiency measurement made for p2m-COMP in Section 4.2. The total re- 
quired amount of pentamer complex is subdivided (e.g. into 5 equal amounts) and titrated 
successively into streptavidin-PE. The concentration of A*0201 pentamer-streptavidin com- 
plex is adjusted to 1 mg/ml with phosphate buffered saline (PBS), supplemented with 0.01 % 
azide and 1 % BSA. This resultant fluorescent reagent is stored at 4°C. 

Staining of antigen-specific T cells 

[0115] Peripheral blood lymphocytes (PBL) are obtained by drawing a blood sample from a 
healthy, A*0201-positive, EBV carrier who had previously been shown to elicit strong cyto- 
toxic T cell responses to the GLCTLVAML antigen. The PBL are isolated by density gradi- 

® 

ent centrifugation on Ficoll (Amersham Biosciences), washed in PBS solution and counted. 

1-2 x 10 6 lymphocytes are allocated for each staining condition and the cells are washed 
once in 1 ml wash buffer (0.1% sodium azide, 0.1% BSA in PBS) and then resuspended in 
50 id of wash buffer. 1 fig of PE-labelled A *0201 /GLCTLVAML pentamer used to stain a 
single aliquot of cells, in combination with an anti-CD8 antibody (Dako), to identify the 
cytotoxic T cell population. The suspension is incubated on ice for at least 30 minutes in the 
dark, after which the cells are washed twice in wash buffer and stored in fix solution in the 
dark (1% foetal calf serum, 2.5% formaldehyde in PBS). Thereafter samples are analysed 
on a Becton-Dickinson FACScan™ flow cytometer, using CellQuest software, according to 
standard methods well known in the art. 
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[0116] Thus the present invention provides at least the following non-limiting, exemplary 
embodiments: 

A. An oligomeric MHC complex comprising at least two chimeric proteins, said chi- 
meric proteins comprising a first section derived from an MHC peptide chain or a functional 
part thereof and a second section comprising an oligomerising domain derived from an oli- 
gomer-forming coiled-coil protein, wherein formation of the oligomeric MHC complex oc- 
curs by oligomerisation at the oligomerising domain of the chimeric proteins, and wherein 
at least two of the first sections are derived from the same MHC peptide chain. 

B. An oligomeric MHC complex of A wherein the first section of the chimeric proteins 
is derived from the extra-cellular part of the MHC class I or II a chain. 

C An oligomeric MHC complex of any one of the preceding listed embodiments 
wherein the first section of the chimeric proteins is derived from the extra-cellular part of 
the MHC class I or II p chain. 

D. An oligomeric MHC complex of any one of the preceding listed embodiments 
wherein the oligomerising domain comprised in the second section in at least one of the 
chimeric proteins is derived from the pentamerisation domain of the human cartilage oligo- 
meric matrix protein (COMP). 

E. An oligomeric MHC complex of D wherein the pentamerisation domain of COMP 
comprised in the second section in at least one of the chimeric proteins comprises and pref- 
erably consists of the amino acids 1 to 128, preferably 20 to 83, most preferably 20 to 72 of 
COMP. 

F. An oligomeric MHC complex of any one of the preceding listed embodiments 
wherein at least one of the chimeric proteins further comprises a first linker between the 
MHC peptide chain and the oligomerising domain. 
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G. An oligomeric MHC complex of any one of the preceding listed embodiments 
wherein at least one of the chimeric proteins further comprises one or more domains se- 
lected from the group consisting of a second linker, a tagging domain and a purification 
domain. 

H. An oligomeric MHC complex according to any one of the preceding listed embodi- 
ments further comprising the complementary MHC peptide chains to at least two of the 
chimeric proteins to form functional MHC binding complexes. 

I. An oligomeric MHC complex according to H further comprising peptide bound to 
the MHC portions of the complex in the groove formed by the MHC al and a2 domains 
for class I complexes or the MHC a 1 and pi domains for class II complexes. 

J. An oligomeric MHC complex according to I wherein the peptide is substantially ho- 
mogeneous. 

K. An oligomeric MHC complex according to any one of the preceding listed embodi- 
ments comprising a label. 

L. An oligomeric MHC complex according to K wherein the label is selected from the 
group consisting of a light detectable label, a radioactive label, an enzyme, an epitope, a 
lectin, or biotin. 

M. A chimeric protein comprising a first section derived from an MHC peptide chain or 
a functional part thereof and a second section comprising an oligomerising domain derived 
from an oligomer-forming coiled-coil protein which coiled-coil protein oligomerises by 
alignment of at least two substantially identical versions of the polypeptide chain from 
which the oligomerising domain is derived. 

N. A chimeric protein according to M comprising an oligomerising domain derived 
from the pentamerisation domain of the human cartilage oligomeric matrix protein (COMP). 
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O. A chimeric protein according to N wherein the oligomerising domain comprises and 
preferably consists of the amino acids 1 to 128, preferably 20 to 83, most preferably 20 to 
72ofCOMP. 

P. A recombinant expression cassette comprising a promoter sequence operably linked 
to a nucleotide sequence coding for a chimeric protein as defined in any one of M, N and 
O. 

Q. A vector comprising the recombinant expression cassette of P. 

R. A pharmaceutical or diagnostic composition, comprising an oligomeric MHC com- 
plex according to any one of A to L, optionally in combination with a pharmaceutically ac- 
ceptable carrier. 

S. A method of labeling and or detecting mammalian T cells according to the specificity 
of their antigen receptor, the method comprising 

(i) combining an oligomeric MHC complex according to any one of A to L and a 

suspension or biological sample comprising T cells, and 
(ii) detecting the presence of specific binding of said complex and the T cells. 

T. A method of separating mammalian T cells according to the specificity of their anti- 
gen receptor, the method comprising 

(i) combining an oligomeric MHC complex according to any one of A to L and 
a suspension or biological sample comprising T cells, and 
(ii) separating T cells bound to said complex from unbound cells. 

U. A primer consisting of a DNA sequence selected from the group consisting of BMC 
#1 [Seq ID No. 2], BMC #2 [Seq ID No. 3], BMC #3 [Seq ID No. 8], BMC #4 [Seq ID 
No. 9], BMC #5 [Seq ID No. 10], BMC #6 [Seq ID No. 11], BMC #7 [Seq ID No. 12], 
BMC #8 [Seq ID No. 13], BMC #9 [Seq ID No. 14], BMC #10 [Seq ID No. 15], BMC #11 
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[Seq ID No. 18], BMC #12 [Seq ID No. 19], A2S #1 [Seq ID No. 20], and A2S #2 [Seq ID 
No. 21]. 

[0117] Except as may be expressly otherwise indicated, the article "a" or "an" if and as 
used herein is not intended to limit, and should not be construed as limiting, the description 
or a claim to a single element to which the article refers. Rather, the article "a" or "an" if 
and as used herein is intended to cover one or more such elements, unless the text expressly 
indicates otherwise. 

[0118] Each and every patent or other published document referred to in any portion of this 
specification is incorporated in toto into this disclosure by reference, as if fully set forth 
herein. 

[0119] This invention is susceptible to considerable variation within the spirit and scope of 
the appended claims. 
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Appendix I: Table of primers used for synthesis of DNA constructs 



Primer name Sequence Directum Sequence (5'-3') 
ID No. 



BMC#1 



2 Forward GCA TCA CCA TAT GAT CCA GCG TAC TC C AAA 
GATTCAGG 



BMC #2 



3 Reverse CTA CAA GGA TCC C ATG TCT CGA TCC CAC 
Reverse TTA ACT AT 



T7 promoter 



4 Forward TAA TAC GAC TCA CTA TAG GG 



T7 terminator 



Reverse GCT AGT TAT TGC TCA GCG G 



BMC #3 



BMC #4 



BMC #5 



BMC #6 



BMC #7 



BMC #8 



BMC #9 



10 



11 



12 



13 



14 



Forward 



Reverse 



Forward 



Reverse 



Forward 



Reverse 



Forward 



TAA AGC TTC AGG GCC AGA GCC CGT TGG GCT 
CAG ACC TGG GCC CGC AGA TGC TTC GGG 
AAC TGC AGG AAA CCA ACG CGG CG 

GAA CGT GAT CTC CCT GAC CTG CTG CCG CAG 
CAG CTC CCG CAC GTC CTG CAG CGC CGC GTT 
GGT TTC CTG CAG TTC CCG AAG 

CTG CAG GAC GTC CGG GAG CTG CTG CGG 
CAG CAG GTC AGG GAG ATC ACG TTC CTG 
AAA AAC ACG GTG ATG GAG TGT GAC GCG 

TAC GGC CGC ACG CTG GGT AGG CCG GTG CGT 
ACT GAC TGC TGC ATC CCG CAC GCG TCA CAC 
TCC ATC ACC GTG TTT TTC AG 

TGC GGG ATG CAG CAG TCA GTA CGC ACC 
GGC CTA CCC AGC G TAC GGC CGC CGC AGC 
CGC AGC CGA AAC CGC AGC CGA AAC CGG 
AAC CGG AAA CTA GTT TGA ACG ACA TC 

TAC TCG A GT TCG TGC CAT TCG ATT TTC TGA 
GCC TCG AAG ATG TCG TTC AAA CTA GTT TCC 
GGT TCC GG TTT 

CGGCTGCGGTTTCGGCTGCGGCTGC 

GAT CCG GTG GTG GTG GTT CTG GTG GTG GTG 
GTT CTG GTG GTG GTG GTT CTG GTG GTG GTG 
GTT CTG GTG GTA 
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AGC TTA CCA CCA GAA CCA CCA CCA CCA 
BMC #10 15 Reverse GAA CCA CCA CCA CCA GAA CCA CCA CCA 

CCA GAA CCA CCA CCA CCG 

BMC #11 18 Forward GAG ACA TGG GAG GTG GTG GTG G 

BMC #12 19 Reverse CCA CCA CCA CCT CCC ATG TCT C 

A2S#1 20 Forward GCA TCA CCA TGG GTT CTC ACT CTA TGA GGT 

ATT TC 

A2S #2 21 r™>™» GCA TAC GGA TCC TTA CGG CTC CCA TCT CAG 

A2S * 2 21 ReVerS€ GGTGAGG 
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Appendix II: Table of Disclosed Amino Acid Sequences 

Sequence ID No. Amino Acid Sequence 

1 GLCTLVAML 

6 SLNDIFEAQKIEWHE 

7 PQPQPKPQPKPEPET 

16 GS(GGGGS) 4 GGK 

17 G(GGGGS) 4 GGK 
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